Quick
Guide for VietSpider
Step-by-step Guide to VietSpider Running and
Maintaining
XML Vietspider
running and maintaining for Windows version allows administrators to
crawl web data, index data and publish result for your own business
or organizations. This quick guide explains how to use Create a
crawled channel to take data from an exist URL.
How to get web data from a website.
First look of Vietspider
Administrators follow these steps to get web data
step-by-step:
Select icon
to Create a new channel. The interface of Create New Channel as
shown below:
Go to a website, take example,
this case we use Amazon Kindle
Copy the link of Kindle on Address
bar, it looks like this:
http://www.amazon.com/Kindle-Wireless-Reader-Wifi-Graphite/dp/B003DZ1Y8Q/ref=amb_link_355368562_2?pf_rd_m=ATVPDKIKX0DER&pf_rd_s=center-1&pf_rd_r=07YFZNAJCGRFF6CXDSNQ&pf_rd_t=101&pf_rd_p=1289229502&pf_rd_i=507846
And then, paste into Sample
Page on Channel Tab of VietSpider:
When you click on icon
The
following interface will appear.
On browse web interface, you
select text (1) from title of
Kindle, Vietspider will automatically detect (2)
which tag belong to on Tree tag on the right. Then, you right
click and select Add block (3).
Finally, you got position of title (4).
After finishing, click on
to finish. We will get back to Create New Channel interface.
Please hit
as following to select exact data to get.
If you click, you will see the
interface as bellow:
Please write field that you
want to get.
In this field we type: Product-name,
Product-price to put Title and price of Kindle.
Select Product-price and
choose Content [0] on Tree Tag , then hit Select
Block, finally we got position for Product-price. As same
with Product-name. Click Finish to save
configuration.
Alright, almost done.
Click on
icon
to check correct configuration of this channel.
And, now we got data look like this:
If nothing wrong, please hit
Back, and click on
icon in the previous window to Save all information of this channel.
Now go to Tools, open
Crawler by clicking on
,
the interface will be displayed as bellow:
Please click on
to
add channels from channel list.
You will see the interface like
this:
On Group field, select
XML, choose Product in Category and take
Amazon com in Source, then click Add Sources
to add to Crawled list.
Then click on
to start crawling data.
The interface when crawling :
Alright, data now in database,
please hit
in main interface to browse what we got.
Many product on our database,
click on a product to view detail:
Get back to main interface.
That comes to the end of this quick
guide for Vietspider. If you have and question related to this,
please feel free to send email to nhudinhthuan@yahoo.com
.